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ABSTRACT 


This  report  summarizes  the  results  of  research  during 
the  past  two  years,  in  understanding  the  resolution  of  the 
autoregressive  (AR)  spectral  estimators  and  developing  and 
evaluating  computationally  efficient  autoregressive-moving 
average  (ARMA)  spectral  estimators.  The  loss  in  the  resolution 
of  the  AR  spectral  estimator  in  the  presence  of  noise  is  related 
to  the  appearance  of  zeros  in  the  z-plane.  A  parallel  resonator 
model  is  developed  to  relate  the  loss  in  resolution  (bandwidth 
expansion)  to  the  signal- to-noise  ratio  and  parameters  of  the 
noiseless  signal  model. 

A  new  technique  for  the  identif ication  of  the  order  of 
an  AR  model  was  derived  that  shows  substantial  stability  compared 
to  the  popular  Akaike  Information  Criterion  method.  Order 
determination  was  emphasized,  since  increase  in  the  order  of 
the  AR  spectral  estimator,  to  account  for  the  presence  of  noise, 
is  naturally  accompanied  by  larger  variance  of  the  estimates  and 
appearance  of  spurious  peaks. 

Several  sub-optimum  (non-maximum-likelihood)  ARMA  spectral 
estimators  were  also  developed.  These  methods  are  computationally 
efficient,  but  statistically  not  very  stable  for  small  data  records. 
An  evaluation  of  the  statistical  properties  of  the  different  sub¬ 
optimum  ARMA  techniques  led  to  the  evaluation  of  asymptotic  bounds 
on  the  variances  of  the  estimates  of  the  parameters  or  the  poles 
and  zeros  of  the  model  through  the  evaluation  of  Fisher's 
information  matrix.  Finally,  a  modification  of  Burg's  MEM  spectral 
estimator  was  developed  that  improves  the  accuracy  of  the  spectral 
estimates  of  complex  sinusoids  and  makes  the  method  considerably 


more  robust. 
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I .  INTRODUCTION 

The  problem  of  spectral  estimation  via  the  autoregressive 
(AR)  and  autoregressive-moving-average  (ARMA)  modeling  of  the 
observed  noisy  signal  is  treated  in  this  report.  High  resolution 
spectral  analysis  based  on  the  AR  model  has  received  much 
attention  in  recent  years  [  1  ]  -  [  3  ]  .  The  basic  impetus  for 

taking  these  approaches  is  due  to  the  fact  that  the  parametric 
modeling  schemes  are  data  adaptive  and  are  free  from  the  effects 
of  window  functions  that  inherent  in  the  traditional 

Blackman  and  Tukey  [  4  ]  type  spectral  estimators.  Furthermore, 
the  AR  model  is  easily  estimated  making  it  useful  in  applications 
such  as  radar  signal  processing  that  require  near- real- time 
processing . 

The  properties  of  the  AR  spectral  estimator  have  been 
studied,  theoretically  in  the  asymptotic  case  [  3  ]  ,  [  5  ]  ,  [  6  ] 
and  empirically  [  1  ]  ,  [  2  ]  .  It  has  been  shown  that  this 
estimator  in  many  cases  offers  considerably  higher  resolution 
based  on  the  same  amount  of  data,  than  the  Blackman  and  Tukey 
type  estimators.  Furthermore,  the  ‘bove  asymptotic  and  empirical 
investigations  have  shown  the  variance  of  the  AR  estimates  to 
be  comparable  to  the  unsmoothed  Blackman  and  Tukey  type  estimates, 
for  the  same  number  of  autocorrelation  lags.  It  should  be 
pointed  out,  however,  that  the  AR  estimates  usually  reouire  much 
fewer  lags  for  the  same  resolution. 

Most  practical  applications  of  spectral  analysis  involve 
noisy  signals  and/or  multiple  signal  and  noise  mixtures. 

Therefore,  it  is  natural  to  consider  the  performance  of  the  AR 
spectral  estimator  in  the  presence  of  noise.  Previous  studies 
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[  1  ]  ,  [  2  ]  had  shown  resolution  degradation  in  the  presence 

of  noise.  The  solution  presented  for  improving  resolution  was 
given  as  increasing  the  order  of  the  estimator ,  in  a  rather 
arbitrary  fashion . 

This  report  deals  with  the  question  of  AR  signals  in  noise. 

The  main  thrust  of  the  work  reported  here  was  to  improve  spectral 
resolution  by  using  ARMA  models  for  the  measured  signals. 
Specifically  some  suboptimum  but  computationally  efficient  ARMA 
spectral  estimation  algorithms  were  developed  and  their  properties 
studied.  In  the  process  of  the  investigations,  an  order 
determination  scheme  as  well  as  a  new  Burg  type  spectral 
estimator,  however,  were  also  developed  and  will  be  described  here. 

1.1  Report  Outline 

This  report  is  organized  as  follows.  First  the  AR  process 
is  defined.  The  sum  of  uncorrelated  AR  processes  and  white 
noise  is  then  considered  and  shown  to  be  represented  by  an 
equivalent  ARMA  process.  A  multiple  resonator  model  for  signals 
of  interest  is  presented  and  shown  to  be  equivalent  to  the  ARMA 
model.  Resolution  degradation  as  a  function  of  noise  and 
resonator  pole  locations  is  considered  and  from  it  some 
representative  curves  of  bandwidth  expansion  are  presented. 

Chapter  III.  of  the  report  deals  with  the  basic  question 
of  AR  model  order  determination.  A  new  criterion  related  to  that 
of  Akaike's  (AIC)  is  derived  and  comparative  examples  are  given. 

The  next  two  chapters  treat  the  basic  emphasis  of  this  work, 
namely  that  of  ARMA  spectral  estimation.  First,  several  ARMA 
schemes  are  presented  that  are  computationally  efficient  but  sub¬ 
optimum.  Sub-optimality  is  treated  with  respect  to  the  maximum- 
likelihood  (ML)  estimates  of  the  parameters.  The  subsequent 


chapter  then  discusses  some  statistical  properties  of  classes 
of  ARMA  spectral-es timator ,  with  respect  to  the  statistic  used 
in  the  estimation.  Here,  again,  the  reference  scheme  is  the 
ML  estimator.  The  final  chapter  of  the  report  deals  with  a 
new  scheme,  related  to  Burg's  MEM  method,  which  is  especially 
useful  for  the  common  case  of  sinusoidal  signals  in  noise.  This 
method  is  shown  to  be  more  robust  than  the  MEM  technique,  with 
better  accuracy  and  equivalent  or  superior  resolution. 
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II.  AUTOREGRESSIVE  SPECTRAL 
ESTIMATION  OF  NOISY  SIGNALS 


The  most  popular  recent  data-adapti ve  spectral  estimation 
method  is  one  based  on  an  all-pole  model  for  the  siqnal.  The 
algorithmic  approach  for  the  estimation  of  the  parameters  of 
such  a  system  include,  the  methods  of  fitting  the  autoregressive 
(AR)  coefficients  to  the  data  as  well  as  the  popular  Burg  maximum 
entropy  method  MEM.  Because  of  the  popularity  of  this  model 
and  computational  simplicity  of  its  estimates,  the  noisy  signal 
spectral  estimation  will  be  confined  to  autoregressive  signal 
models.  That  is,  it  will  be  assumed  that  in  the  absence  of 
noise  and  interference  an  AR  model  satisfactorily  describes 
the  signal.  Resolution  dearadation  of  the  AR  spectral  estimates 
in  the  presence  of  noise  and  interference  is  then  investigated 
and  related  to  the  changes  in  the  model  structure. 


II. 1  The  AR  Spectral  Estimator 

x.  U 

A  zero-mean  time  series  {x*_}  is  said  to  satisfy  an  Lr  order 
autoregressive  model  if: 


x 


t 


X 


t-i 


+ 


(II. 1) 


where  {a^}  denote  the  AR  coefficients  and  {u^}  is  a  zero-mean 
uncorrelated  (white)  sequence.  Another  interpretation  of  the 
model  in  (II. 1)  is  that  {ou}  represent  an  order  one-step  ahead 

linear  predictor  of  {x^,}.  If  {a^}  are  then  estimated,  based  on 
a  minimum  mean  square  error  criterion  {u^}  on  the  average  becomes 
an  orthogonal  sequence.  It  can  be  shown  [2  ]  that  the  model  in 
(II. 1)  leads  to  a  spectral  density  of  the  form 
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where  is  the  spectral  level  of  the  {u^}  sequence. 

The  spectral  density  shown  in  (II. 2)  is  the  AR  spectrum 
and  it  is  this  model  that  is  fitted  to  an  observed  time  series, 
by  simply  estimating  {ou}  from  the  time  series.  Several  estimation 
procedures  for  have  been  discussed  in  the  literature  such 

as  the  maximum  likelihood,  the  least-squares  [7]  and  Burg's 
method  based  on  forward  and  backward  prediction  error  filtering 
[8].  If  the  number  of  data  samples  is  not  very  small, 
the  simplest  and  computa tional ly  most  efficient  estimates  of 
{a^}  are  the  solution  of  the  Yule-Walker  equations.  These 
equations  arise,  simply  by  multiplying  equation  (II. 1)  by 
^ ,  i=l,...L  and  taking  the  expectation  of  both  sides,  to 
obtain  a  relation  between  the  autocorrelation  function  of  the 
process  {x„}  and  the  coefficients  {a^}.  The  Yule-Walker  equations 
are  then  given  by: 


R  A  =  pn  (II. 3) 

o 
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and  where  is  the  autocorrela tion  function  of  {xt}  at  the  i  lag. 
Furthermore,  the  power  in  {u^.}  can  be  found  by  multiplying  (II.  1) 


by  x^  and  taking  the  expectation  as: 


(II. 4) 


S1  ‘  (ro 


£  a.r.JAT  ,  AT  the  sampling  interval 
i  =  l  11 


In  practice  {ou  }  and  are  estimated  from  (II. 3)  and  (II. 4)  based 
on  estimates  of  the  autocorrelation  function  {r^}. 

II. 2  Spectral  Estimation  in  the  Presence  of  Noise  and  Interference 


We  now  assume  that  the  signal  {xfc}  satisfies  an  L  ^  order 
AR  model  and  therefore  its  spectrum  can  be  estimated  as  in  the 
previous  section.  The  problem  of  interest  is  the  estimation  of 
the  spectrum  of  the  observed  signal 


Yt  =  xt  +  u)t  +  nfc 


(II. 5) 


where  {w  }  is  an  AR  (M)  process  and  considered  to  be  the  inter¬ 
ference  and  {nfc}  is  a  white  noise  sequence,  with  (nt),  {xtl  anc^ 
{wt}  mutually  uncorrelated. 

The  resolution  of  the  spectral  estimators  are  now  discussed  in 
the  asymptotic  case,  that  is,  when  the  autocorrelation  function 
of  {yt }  is  accurately  known. 

Let  { }  be  described  by  (II. 1)  and  be  given  by  the 

following  autoregressive  model 


uJ  =  i.  b.  co.  .tv, 
t  ,  l  t-i  t 

1  =  1 


The  z-spectrum  of  y^  is  then  given  by: 


S  (z)  =  - — pm 

D  (z)  D  (z  ] 

X  X 


(II. 6) 


D  { z )  D  (z  ") 

CO  a) 


where  S ^  and  are  given  by  (II. 4)  using  the  appropriate  auto¬ 
correlation  values  for  (xt)  and  {wt},  Sn  is  the  spectrum  of  nt  and 
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L  M 

D  ( z )  =  1  -  Z  a. z1  ,  D  (z)  =  1  -  I  b.z1 
x  i=l  1  ^  i=l  1 

Putting  (11,6)  under  a  common  denominator,  it  becomes  obvious 
that  SY(z)  is  the  spectrum  of  an  autoregressive  moving  average 
process  of  orders  L+M  and  L+M,  i.e.,  AR ( L+M) /MA ( L+M) .  This  is 
a  process  with  L+M  zeros  and  L+M  poles,  where,  from  equation 
(II. 6),  the  numerator  polynomial  coefficients  are  related  to 
{a^},  {b^}  and  Sn  in  an  obvious  manner. 

It  can  be  seen  that  the  estimation  of  S^{z)  using  a  purely 
AR  technique  (all-pole)  is  equivalent  to  approximating  the  (L+M) 
order  moving  average  component  by  an  AR  one.  This,  theoretically 
would  require  an  infinite  order  model.  Finite  order  models  of 
order  L,  however,  will  estimate  Sy(f)  with  good  resolution,  with 
L  depending  on  the  various  parameters  of  signal  noise  and  inter¬ 
ference,  notably  their  relative  power. 

It  is  obvious  now  that  whereas  the  resolution  of  Blackman 
and  Tukey  type  spectral  estimations  are  only  determined  by  the 
window  bandwidths  in  a  predictable  fashion,  those  of  the  AR  and 
generally  ARMA  estimators  are  very  much  data  dependent,  requirina 
larger  all-pole  orders  or  ARMA  modeling.  Figure  (II.  1)  shows  the 
spectrum  of  a  noiseless  AR  signal.  Figure  (II.  2)  shows  the 
calculated  AR  (from  exact  values  of  the  autocorrelation  function) 
spectrum  of  the  same  signal  in  the  presence  of  white  noise,  using 
order  L  =  20.  The  degradation  of  spectral  resolution  is  obvious. 
A  different  approach  to  the  demonstration  of  the  dependence  of 
spectral  resolution  on  the  signal- to-noise  ratio  is  to  consider  a 
parallel  resonator  model  for  the  measured  signal.  This  is 
discussed  in  the  next  section. 


A 
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II. 3  Parallel  Resonator  Model 

High  resolution  spectral  estimation  is  normally  used  in 
situations  where  the  signal  has  a  "peaky"  spectrum.  Therefore, 
one  may  postulate  the  signal  model  as  the  sum  of  the  outputs  of 
M  second-order  resonators  driven  by  white  noise.  This  model, 
as  will  be  shown  in  the  sequel,  lends  itself  to  an  investigation 
of  the  resolution  degradation  or  bandwidth  expansion  as  a  function 
of  measurement  noise. 

Mathematically,  the  signal  s(n)  is  modeled  as  [  9  ], 

M 

s{n)  »  I  s  (n) 
m=l 

where 

sm(n)  “  amlsm(n"1)  +  am2sm(n'2)  +  am0u(n)  (II'7) 

where  u(n)  is  the  driving  noise  of  the  signal  generation  process. 

2 

It  is  assumed  that  u(n)  is  white  with  variance  a  .  The  signal 
model  is  shown  in  figure  (II. 3). 

The  transfer  function  of  the  signal  generation  process 
is  defined  as 


H(z) 


S(z) 

uTST 


(II. 8) 


where  S(z)  and  U(2)  denote  the  z-transforms  of  the  signal  s(n) 

and  the  noise  u(n)  respectively.  The  transfer  function  H  (z)  of 

m 

the  m’th  resonator  is  obtained  by  taking  the  z-transform  of  s  (n) 

^  m 

- 1  -2 

S  f  z )  =  a  ,  z  S  ( z )  +  a  ~  z  S  ( z )  +  a  A  rJ  ( z ) 
m  ml  m  y  m2  m  m  0 


Hence , 


m 


Sm(z) 

U  (  2  ) 


am0 


-  a 


ml 


-1 

z 


m2 


-2  * 
z 


1 


(II. 9) 
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The  z-transform  of  the  signal  s (n)  is  then 


S(z)  =  £  S  (2) 

m=l 


=  l  U(z)Hm(z) 
m=  1 


We  can,  therefore,  express  the  transfer  function  for  the 


generation  of  s(n)  as: 


H(z)  = 


MM  12 

,  “  ak0  nn  (1  ‘  amlZ~  "  am2z”  > 
k=i  m=l 


- 1  -  2 

TI  ( 1  ~  a  ,  z  -  a  0z  ) 
,  ml  m2 

m=l 


(II. 10) 


It  can  easily  be  seen  that  the  order  of  the  numerator  polynomial 
N(z)  of  the  transfer  function  H(z)  is  2M-2  while  the  order  of 
the  denominator  polynomial  is  2M.  H(z)  is  also  recognized  as 
the  transfer  function  of  an  autoregressive-moving  average  (ARMA) 
model.  Therefore,  using  the  parallel  resonator  model  is  the  same 
as  using  an  ARMA  model  where  the  difference  in  order  between  the 
autoregressive  (AR)  and  moving  average  (MA)  terms  is  two. 

Thus,  in  the  absence  of  noise,  s (n)  can  accurately  be  modeled 
as  an  ARMA  process  (or  the  output  of  a  pole-zero  filter).  If  s(n) 
is  corrupted  by  the  uncorrelated  white  noise  sequence  w(n),  the 
z-power  spectrum  of  the  observation,  y(n),  is  given  by 


S,(Z)  » 


_  |  N  (  z  ) 


!  D  (zT  j  °U  +  °w 

|H(Z) I2  +  o2|d(z) I2 


therefore,  in  the  presence  of  noise,  the  model  of  y(n)  is 


equivalent  to  an  ARMA  (2M(  2M)  process.  This  of  course  is 
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consistent  with  the  model  derived  in  the  preceding  section 
directly  from  the  addition  of  AR  processes  and  white  noise. 


II. 4  Analysis  of  a  Single  Resonator 

Some  insight  into  the  behavior  of  the  parallel  resonator 
model  in  the  presence  of  noise  can  be  gained  by  examining  the 
behavior  of  each  resonator  individually. 

To  simplify  the  analysis /  we  assume  that  the  noise  w(n)  is 
added  equally  to  the  output  of  each  resonator.  In  the  following 
first  the  set  of  conditions  which  the  parameters  of  each  resonator 
must  satisfy  for  stability  and  resonance  are  discussed.  The 
relations  between  the  parameters  and  the  location  of  the  poles 
of  a  resonator  are  then  derived.  The  effect  of  additive  noise 
on  the  poles  is  then  examined.  Finally,  the  perturbations  of 
the  pole  positions  of  the  noisy  resonator  as  a  function  of  the 
signal- to- noise  ratio  is  analyzed. 

The  transfer  function  of  the  m'th  resonator  is  given  by 


Vz»  - 


V2' 


1  -  a  .  z 
ml 


A(z) 

m 


(II. 11) 


a  ~  z 
m2 


The  poles  of  H  (z)  must  be  complex  conjugates  for  S  (n)  to  be  a 
m  ^  m 

narrow  uand  process.  The  locations  of  the  poles  are  the  solution 
of  the  quadratic  equation 


2*  ,  x  2 
z  Am(z)  =  z 


a  ,  z  -  a  ^  =  0 
ml  m2 


(11.12) 


or 


14 


i 


z 


aml  +  4am2 

2 


the  condition  for  having  complex  conjugate  poles  is  that 
the  discriminant  must  always  be  negative.  That  is. 


or 


a  ,  +  4a  ~ 
ml  m2 


0 


(11.13) 


If  equation  (11.13)  is  satisfied,  then  the  poles  would  be  in 
polar  coordinates , 

J  m 

z  =  p  e 
m  m 

where  is  the  distance  from  the  origin  of  the  unit  circle  to 
location  of  the  pole  and  is  the  frequency  of  resonance  in 
radians/ second. 

.  * 

The  system  is  stable  if  z  and  z  are  located  within  the 

m  m 

unit  circle.  The  conditions  for  stability  are  given  by  the 
Jury  test 


A  ( 1 )  >  0 
m 

V*11  >  0 

and  1  >  I  a  ~ 

j  m2 

Therefore,  the  parameters  must  satisfy 


aml  +  am2  s'  1 
aml  “  am2  >  _1 
!  am2 


(11.14) 


and 


<  1 
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When  equations  (11.13)  and  (11.14)  are  satisfied,  the  impulse 
response  h^(n)  is  easily  shown  to  be  [10] 


pn  sin(n+l)w 
m  m 

mO  sin  d 

m 

★ 

Upon  substituting  for  and  in  equation  (II. 12)  and  equating 

★ 

A  (z  )  and  A  (z  ) ,  it  follows  easily  that 
mv  nr  mm'  2 


S  ,  ~  2r  COSW 

ml  m  m 

(11.15) 

2 

a  n  =  -r 
m2  m 


It  has  already  been  shown  that,  in  the  absence  of  noise, 
the  m'th  resonator  has  2  complex  conjugate  poles  located  within 
the  unit  circle  and  2  zeros  located  at  the  origin  of  the  unit 
circle  (equation  11,11).  The  effect  of  the  addition  of  the  white 
noise  is  to  move  the  zeros  from  the  origin  towards  the  poles. 

When  the  noise  dominates  the  signal,  the  zeros  cancel  the  poles 
resulting  in  a  flat  spectrum. 

If  we  let  Ym(n)  be  the  output  of  the  m’th  noisy  resonator,  then 


Vn) 


sm(n) 


w  (n) 
M 


=  sm(n)  +  (n) 


(11.16) 


where  w1 (n)  is  white  with  variance 

2 


o 


w  1 


"hen,  the  power  spectrum  of  y  (n)  is 

m 
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sy  <2>  *  °u  Hm<2>  2  +  °l' 
1  m 


2  2 
amOau 

i  +  * 

A  ( 2 )  A  ( z  X) 
m  m 


amO°u  +  gw'Am(z)Am(z~1} 
Am(z)Am(2"1) 


If  we  let 


<’oV*>V*'1>  *  amo"u  +  82'V*>V*‘1> 


(11.17) 


we  see  that  y  (n)  may  be  modeled  as  the  output  of  a  rational 

transfer  function  with  B  (2)  as  the  numerator  and  A  (z)  as  the 

m  m 

denominator  polynomials  with  o(n)  as  a  white  input.  Then 
- 1  -2 

B  (z)  =  1  -  b  -  b  ~z  and  n(n)  is  a  zero  mean  white  sequence 

m  ml  m2 

2 

with  variance  a  .  It  is  known  [11]  that  a  B^(z)  exists  which  has 
its  roots  on  or  within  the  unit  circle.  Equation  (11.17)  can  be 
written  as 

°n(1  -  bmlz~ 2  -  bm2z‘2>(1  -  bml2  *  * 


(11.18) 


amO°u  +  V  (1  -  ^l2"1  ‘  am2z_2)  (1  “  amlz  “  am2z2) 

Upon  expanding  the  above  equation  and  equating  terms  of  equal 
degree  on  both  sides  we  get 


22  2  , ,  2  2  x  2  ..  L2  ,2, 
araO°u  +  V (1  +  aml  +  am2  "  V1  +  bml  +  bm2> 

aml°w 1  *ajn2  '  11  '  bml%lbm2  '  11 

2  _  2, 
a  « 0  ■  o  b 
m2  w  n  m2 


fix. 19)  (b) 


Equations  (11.19)  (b)  and  (c)  imply 


b 


ml 


aml(am2 
am2 (bm2  ' 


1 )  b 


m2 


1) 


(11.20) 


Equations  (11.19)  (a)  and  (c)  imply 


2  2  2  . ..  2 

a  Ao  +  a  , (1  +  a  , 
mu  u  w  ml 


+  a 


m2  ; 


1  +  b 


ml 


+  D 


m2 


a  ~o  , 
m2  w 

m2 


(11.21) 


or, 


*  *  »  *  4  *  am2> 

1  +  b2.  +  b2 

ml  m2 


m2 

’m2 


where 


X 


am0° 
2 

w’ 


(11.22) 


2 

If  the  noise  variance  o  ,  is  small,  then  X  >>  1.  Equation  (11.22) 

am2 

imolies  therefore  that  r — *  >>  1  or  a  0  >>  b  0  in  which  case 

bm2  m2  m2 

equation  (11.20)  shows  that  b  ^  +  0.  Then,  for  hiqh  signal- 
to-noise  ratios,  the  zeros  are  located  near  the  origin  and  their 
presence  does  not  affect  the  spectrum  seriously.  The  exact 
locations  of  the  zeros  is  found  by  solving  equations  (11.20)  and 
(11.19)  simultaneously. 

II. 5  Perturbations  of  Pole  Positions 

The  addition  of  white  noi^  to  a  process  given  by  a  resonator 
mocel  tends  to  move  the  power  spectral  density  peak  and  to  broaden 
the  bandwidth.  It  has  been  seen  in  the  previous  section  that  this 
is  caused  by  the  zeros  being  shifted  from  the  origin  towards  the 
poles.  If  we  now  model  the  observation  again  by  an  all-pole  second- 
order  resonator  model,  the  positions  of  the  poles  will  have  changed. 


It  is  this  perturbation  which  causes  an  expansion  in  the 
estimated  bandwidth.  In  the  following  this  effect  is  examined. 
The  output  of  the  m'th  resonator  is 


Ym  (n)  =  s  (n)  +  w*  (n) 


where  s  (n)  is  a  second-order  process. 


s  (n)  =  a  ,s  (n-1)  +  a  s  (n-2)  +  a  nu(n) 
m  ml  m  m2m  mO 


Ym(n)  may  also  be  modeled  as  a  second-order  process, 
ym(n)  =  amlymin-l)  +  am2ym(n-2)  +  amOu(n)‘ 

l  l  I 

In  the  absence  of  noise,  a  a  0  and  a  ^  are  identical  to 

ml  m2  mO 

aml /  am2  and  am0  respectively.  The  autocorrelation  function 

r  (n)  of  the  process  y  (n)  has  been  shown  to  be  given  by  [10], 
ym 

ry  (n)  =  am0°u  plSlacos(!nl%  '  V  +  °w’5(n) 

1  m 


where , 


1  +  (- 


1  -  P 


1  +  P4W 


1  +  p 


2  ,2 

cot  U) 

2  m 


2  2  4 

l-o  1  -  2o  cos2u)  +  p 
m  m  m 


(11.23) 


=  arctan 


1  +  r 


m 

-*■  COtw  . 
2  m 


;il .24) 


and,  p  and  u>  are  related  to  the  parameters  a  .  ,  a  ^  by 
mm  ml  m2 


p  =  /-a  ~ 
m  m2 


U>  = 

m 


2vr-r 


arccos 


<rrr  ) 
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The  distance 

pm 

from 

the  origin 

of 

the 

unit 

circle  to 

the 

location  of 

the 

pole 

is  related 

to 

the 

3  dB 

bandwidth 

B  ’in 
m 

Hz  of  the  m* 

th 

resonator  by  [  9  ] 

then , 


P 

P 


m 

m 


e 


e 


2 


B 

m 

2 


where  B  is  the  3  dB  bandwidth  in  Hz  of  the  noisy  resonator, 
m 

Define  the  bandwidth  expansion  factor  (BEF)  as 


BEF 


B 

m 


1  + 


In(^) 

in  p 

m 


(11.28) 


Equation  (11.27)  shows  that  the  BEF  depends  on  the  signal- to- 
noise  ratio  (SNR),  £  ,  and  wm.  Families  of  curves  of  the  BEF 
as  a  function  of  the  SNR  in  the  range  -30  dB  to  +30  dB  have  been 
generated  for  3  different  frequencies,  .125  Hz,  .166  HZ  and  .25  Hz. 
They  are  shown  in  figures  (II. 4),  (II. 5)  and  (II. 6)  respectively. 

It  is  seen  from  the  graphs  that,  for  the  same  SNR,  the  BEF  is  an 
increasing  function  of  p  and  a  decreasing  function  of  u  .  For 
small  Pmfs,  the  resonator  has  a  large  bandwidth  and  adding  white 
noise  (white  noise  has  flat  spectrum  between  -.5  Hz  and  +.5  Hz) 
does  not  affect  the  spectrum  as  much  as  for  large  Pm's. 

The  above  discussion  demons  crates  clearly  that  the  asymptotic 


resolution  of  an  all-pole  model  depends  in  a  non-linear  fashion 
on  both  the  bandwidth  and  frequency  of  the  noiseless  signal . 


Therefore,  in  general  an  increase  in  the  model  order  does  not 
uniformly  improve  the  spectral  estimates  of  all  the  spectral 
peaks  and  the  improvement  is  rather  unpredictable.  Curves  such 
as  given  in  figures  (II. 4)  -  (II. 6)  may,  however,  be  used  as 
a  general  guideline  for  the  amount  of  degradation  in  a  given 
application . 
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FIGURE  H.6  BANDWIDTH  EXPANSION  FACTOR  VERSUS  SIGNAL  -  TO- NOISE  RATIO 
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III.  ORDER  DETERMINATION  FOR  AR  MODELS 


The  previous  chapter  demonstrated  the  need  for  increased 
order  of  an  AR  spectral  estimator  to  improve  resolution  in  the 
presence  of  noise,  resulting  in  a  larger  variance  for  the 
estimates.  In  fact,  the  trade-off  between  the  bias  (or 
resolution)  and  variance  of  a  spectral  estimator  is  the  central 
issue  in  spectral  estimation  by  any  method.  For  the  traditional 
(Blackman  and  Tukey  type)  spectral  estimators,  this  trade-off 
is  reflected  in  the  choice  of  the  spectral  window  type  and  the 
maximum  lag  of  autocorrelation  function  used.  This  subject, 
referred  to  as  window  carpentering,  is  discussed  in  detail  by 
Jenkins  and  Watts  [12],  and  is  straightforward  because  resolution 
is  well-defined  in  terms  of  the  spectral  window  bandwidth. 

With  the  popularity  of  data  adaptive  (notably  the  auto¬ 
regressive  (AR) )  spectral  estimation  methods,  similar  resolution- 
variance  trade-offs  are  in  order.  Specifically,  well-defined 
methods  are  needed  to  determine  the  order  of  the  (AR)  spectral 
estimator  for  a  given  data  sample.  Furthermore  for  practical 
applications,  these  methods  need  to  be  on-line  and  as  much  as 
possible  objective  in  nature.  This  problem  is  complicated,  however, 
due  to  the  data  dependent  nature  of  the  resolution  of  the  AR  spectral 
estimator  as  shown  before  (e.g.,  no  well-defined  window  bandwidth). 
Therefore,  the  question  of  order  determination  for  the  spectral 
estimator  seems  to  be  best  posed  as  a  procedure  for  obtaining  a 
compromise  between  the  AR  model  fit  and  the  variance  of  the 
estimated  AR  parameters  as  a  function  of  the  model  order. 

Akaike  [13,14]  and  Parzen  [15]  have  recently  introduced  some 
methods  for  automatic  de termination  of  orders  of  autoregressive 
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processes.  One  method,  based  on  Akaike' s  Information  Criterion 
(AIC),  has  gained  special  popularity.  In  this  report,  we  follow 
the  derivations  on  which  AIC  is  based,  introduce  appropriate 
modi f ications  to  account  for  practical  estimation  procedures  and 
derive  a  new  information  criterion  designated  the  Conditional 
AIC  ( CAIC ) .  We  then  present  the  results  of  a  number  of  numerical 
simulations  that  compare  the  performances  of  AIC  and  CAIC  for 
moder  order  identification  and  spectral  estimation. 

III.l  Akaike's  Information  Criterion 

Akaike  derived  his  information  criterion,  AIC,  as  an 
estimate  of  the  asymptotic  relative  goodness  of  fit  of  the  model 
to  the  observation.  Although  his  derivations  were  based  on 
information  theoretic  arguments,  the  resulting  parameters  were 
the  same  as  the  maximum  likelihood  estimates.  In  this  section 
we  review  the  steps  involved  in  obtaining  AIC  as  they  pertain 
to  the  derivation  of  the  new  criterion.  We  assume  the  time  series 
to  be  described  by 
L 

x  =  Z  a . x.  .  +  u. ,  t=0 , . . . N  (III.l) 

t  i=1  i  z 


where  u^  is  zero-mean  white  and  Gaussian  and  L  is  to  be  determined. 
Through  asymptotic  arguments,  Akaike  defines  an  information 
criterion,  related  to  the  maximum  likelihood  of  the  estimates  of 
ai  ,  aj,,  as: 


AIC (A) 
+  E  N 

JO 


(-2) In  (maximum  likelihood) 


A  -  A| 


2 


(III . 2) 


rr 
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where  denotes  asymptotic  expectation,  A  and  A  are  L  x  1  vectors 

/\ 

of  the  coefficients  ai  and  their  estimates  ai .  The  practical 
AIC  which  is  related  to  the  full-information  likelihood  function 
of  a  Gaussian  process  is  then  given  by 

AIC(L)  =  N  In  (MLE  of  innovation  variance)  +  2L  (III. 3) 
and  the  order  L  is  chosen  that  minimizes  AIC(L). 


III. 2  The  New  Criterion 

Since  the  exact  maximum  likelihood  (full  information 
maximum-likelihood)  estimates  are  generally  not  available,  the 
conditional  MLE,  one  based  on  Yule-Walker  equations  or  Burg's 
algorithm,  of  the  innovation  variance  are  normally  used  in  (III. 3). 
We  propose  using  the  conditional  maximum  likelihood  (CML)  function 
in  (III. 2).  This  function  is  based  exactly  on  the  available  data 
and  we  believe  is  a  more  sensitive  indicator  of  the  behavior  of 
the  estimates  used  in  practice.  Thus,  in  the  following,  the  CML 
estimate  of  A  and  its  covariance  function  are  considered,  in  order 


to  obtain  tractable  expressions  for  (III. 2). 

The  conditional  (partial  information)  likelihood  function 
for  the  time  series  in  ( 1 1 1 . 1 )  is  given  by: 


L ( A , ,^1* " * ' 


T 

1  -CDC 

___  axp  \ 

-  -r  2ou  ; 


(III  .4' 
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Furthermore,  the  CML  estimate  of  is  given  by: 

3  2  =  CTDC/(N-L) 
u  -  — 


(III. 6) 


and  a  lower  bound  for  the  variance  of  the  estimates  of  a^  follows 
from  the  Fisher's  information  matrix  to  be 


var  [a.  ]  >  i  a  2Ai:L 
-  N-L  u 


(III. 7) 


where  A1  is  the  diagonal  element  of  the  inverse  of  the  (L  +  1)- 


sample  covariance  matrix  of  xfc.  It  is  now  shown  that 

a  2 A11  =  1 
u  - 

2 

The  innovations  variance ,  ,  is  given  by  the  Yule-Walker 

relation  as 


a  2  =  r  =  [r, . . .rT ] AT^ 
U  o  1  LJ  -L 


where  A^+^  is  the  L+l-st  order  covariance  matrix  given  by: 

(ro  rL  1 


rr  .  .  .  .  r 
L  o 


The  first  diagonal  element  of  the  inverse  of  Al+-j_  ,  A  is  then 
given  by : 


(III. 8) 


(III. 9) 


(III . 10) 


Formula  (III. 9)  can  be  rewritten  as 
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ro!ALl  -  [r1...rL]SL 


2 

°u  = 


I  At 


where  ST  is  the  adjoint  of  Ar ,  given  by 

L/  1j 


(hi. id 


-  L  = 


'M1  • 


-p- 


i  nl'l 
(-1)  U 


.  ,  x  L-l 
(-1)  u 


L-l 


L-l 


and  p.  =  Minor  of  element  r •  in  AT . 

1  1 

Comparing  (III. 10)  and  (III.  11)  it  is  obvious  that  we  need  to  show 


=  Numerator  in  (III.il) 


From  the  form  of  AL+1  we  have 


IW  -  rol-AJ  +  (-X>rl  ,E,  r-i  <-l)i_1U|i_x 

1  =  1  1 


* 


*  ‘ ‘l’ 2  dl  ri(‘1,i‘1"|i-2|+" 


or 


L  L 


IW  =  rolALl  +  *  ,E,  rirj(-1)it:l'lu|i-j 

-  1  =  1  J  1  J 


(III . 12) 


3 


But  the  numerator  in  (III. 11)  is  also  after  multiplying  the  matrices: 

Sum  =  rD|AL|  -  I  .t  | i- j  |  f’1 > i  +  J_1 

3  =  1  i=l  J  1  J  1 
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V 


Resulting  in 


(III. 13) 


as  required. 

We  now  proceed  to  define  an  expression  for  (III.  2)  based  on 

2 

the  CML  estimates  of  A  and  a .  The  expression  for  the  CML  given 
in  (III. 4)  is  now  substituted  in  {III. 2)  for  the  maximum  likelihood 


and  using  (III. 8)  for  the  second  term  in  (III. 2)  and  (N  -  L)  for 
N  we  have : 


CAIC(L)  =  (N  -  L)  ln(27ta 


(III. 14) 


The  factor  a  >  1  is  included  to  account  for  the  asymptotic  nature 
of  the  criterion  and  the  fact  that  (III. 8)  is  a  lower  bound  for 
the  variance  of  .  A  similar  parameter  was  also  suggested  for 
AIC  [16]  and  in  [14]  Akaike  discusses  a  possible  approach  for 
choosing  a.  Since  CAIC(L)  as  given  by  (III. 4)  is  dependent  on 
the  variance  of  x^,  the  test  is  standardized  by  introducing  a 
normalized  innovation  variance  so  that 


CAIC(L)  =  (N  -  L)  In  [o^/ ( var  x^)  ] 
+  (a  -  1 )  L 


(III. 15) 


Thus  CAIC(O)  =  0,  The  factor  a  is  chosen  to  give  more  or  less 
weight  to  the  error  in  the  estimation  of  the  parameters.  In 
other  words,  resolution  can  be  increased  at  the  expense  of  the 
variance  of  the  estimates  by  decreasing  a.  We  have  found, 
empirically,  values  of  3.5-4  to  give  the  most  stable  and  reasonable 
indication  of  the  order. 
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III. 3  Simulation  Results 

We  have  tested  the  performance  of  CAIC  relative  to  AIC  on 
a  number  of  time  series  models  reported  previously.  The  data 
included  normal  as  well  as  uniform  distributions.  The  estimates 
were  based  on  CML  (least-square)  and  Yule-Walker  methods.  In 
the  great  majority  of  cases,  CAIC  performed  as  well  or  superior 
to  AIC.  Examples  of  these  can  be  found  in  [17] .  Some  estimated 
spectra  based  on  orders  determined  by  AIC  and  CAIC  are  also  shown 
in  figures  (III.l)  -  (III. 3).  Yule-Walker  equations  with  auto¬ 
correlation  function  estimates  given  by 


i  n:1 

ri  N  XjXj  +  i 


were  used.  The  example  shown  in  Figure  (III.l)  indicates  the 
relative  stability  of  CAIC.  Figure  (III. 2)  shows  that  the  model 
order  chosen  by  AIC  results  in  spurious  peaks,  while  giving 
higher  peak  resolution  than  the  CAIC  based  one.  Figure  (III.  3) 
shows  that  an  increase  in  white  noise  level  increased  the  AIC 
order  to  the  point  that  spurious  spectral  peaks  became  pronounced 
while  CAIC  remained  nearly  the  same,  showing  the  relative  stability 
of  CAIC. 
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FIGURE  III.  3  Log  power  spectrum  of  xt=sin  (  .  67Tt  +  30 
+n  ;  a  2=.316,  N=100;  CAIC=5 ,  AIC=15 
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IV.  SOME  SUBOPTIMUM  ARMA  SPECTRAL  ESTIMATORS 


As  shown  in  Chapter  II,  the  presence  of  additive  noise  on 
the  observations  of  an  AR  process  implies  that  they  are  an 
ARMA  process.  A  popular  approach  to  circumvent  the  degradation, 
wrought  by  this  model  change,  of  the  AR  spectral  estimate  is 
to  derive  spectral  estimators  based  on  the  ARMA  model.  Box 
and  Jenkins  [  7  ]  among  others,  provide  an  algorithm  to  give  a 
close  approximation  to  the  maximum  likelihood  (ML)  estimate  of 
the  ARMA  parameters,  assuming  Gaussian  observations.  This  is 
considered  the  optimum  estimator  in  view  of  the  desirable 
properties  of  maximum  likelihood  [18].  But  optimum  estimators 
have  computa tional  disadvantages,  described  in  the  next  chapter, 
which  have  compelled  many  researchers  to  consider  alternative 
criteria  that  yield  spectral  estimators  with  greater  computational 
efficiency.  There  is  presently  considerable  activity  in  this 
area,  appropriately  termed  suboptimum  ARMA  spectral  estimation. 

In  this  chapter  a  new  suboptimum  scheme  is  reported  for 
estimating  the  power  spectral  density  (PSD)  of  an  ARMA  process 
of  known  orders.  After  a  preliminary  data  reduction,  this  scheme, 
called  the  least  squares  (LS)  estimator,  minimizes  a  sum  of  squared 
quadratic  functions  of  the  AR  coefficients  using  a  nonlinear  least 
squares  algorithm.  The  poles  of  the  estimated  PSD  are  found  from 
the  minimizing  AR  coefficients,  and  zeros  are  found  from  quadratic 
functions  of  these  coefficients.  Note  that  these  results  have 
been  published  [ 19]  . 

The  general  idea  of  least  squares  fitting  the  ARMA  parameters 
is  not  new,  and  various  other  approaches  have  been  suggested 
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(see,  e.g.,  [7]  and  [20]).  The  scheme  discussed  here  is, 

however,  analogous  to  a  minimum  mean-squared  error  estimation 
of  the  parameters  appearing  in  the  estimator  discussed  in  [21] 
and  [22],  A  modification  of  the  latter  estimator  that  is  based 
on  the  modified  Yule-Walker  equations  (the  MYW  estimator)  is 
also  presented,  in  which  the  problem  of  negative  excursions  of 
the  estimated  PSD  is  corrected  by  tapering  the  estimated  moving- 
average  autocorrelation  function.  Examples  are  shown  that  compare 
the  performance  of  these  ad  hoc  techniques  to  the  approximate 
maximum  likelihood  method  of  3ox  and  Jenkins. 


IV .  1  The  Spectrum  of  an  ARMA  Process 

Assume  that  we  observe  xt,t=l,...,N  where  xt  is  stationary 
and  Gaussian  of  mean  zero,  and  that  xfc  fits  an  ARMA  (L,M)  model. 
Then  we  can  write 


L 

xt  “  aixt~l  =  ut  (IV-X) 


where  a^  are  the  AR  parameters  for  the  ARMA  model  and  ufc  is  the 
MA  residual  sequence  given  by 


u 


t 


M 

Z 

i=l 


bict-i 


(IV. 2) 


where  b^  are  the  MA  parameters  and  is  a  zero-mean  uncorrelated 

2 

normal  sequence  of  variance  a  .  Define 


A  [if  <3^,...,— a^]. 

Then  the  PSD  of  x^_  is  given  by 

Vz>  *  Vz> 

T  —  1  — k 

where  Z,  =  (l,z  ] ,  z  being  the  z-transform  operator 


(IV. 3) 
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,  »  is  the  PSD  of  ut.  Writinq 
z  =  e3w,  and  where  SuU)  Is  tne 

,  -M  .  -Mv,  (IV. 4) 

2rl  +  B  (z_:L  +  +  +  6m^z 

su(z)  =  +  »i<z 

we  can  express  the  PSD  of  ufc  in  terns  of  its  v«l»nCe 

»  normalized  autocorrelation  function  coefficie  l- 

normality  of  nt  gives  ns  «  conditional  — " 

218-225] 


(IV. 5) 


E(utlut.i) 


9iut-i 


,  u  the  best  linear  prediotor  o£  ut  dived 
and,  in  general,  Si  is  tn 

mpan-square  error  (mmse)  sense. 

..  in  the  minimum  mean  squai 

t-i 

-nr  *5  LS  Es t iraa to ^ 

- - - -  ■  _  o£  e  is  obtained  by  minimize 

A  least  squares  estimate  of  ^ 

given  by 
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N 
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lu. 


t=L+i+l 

N 

t=L+i+l 


^iut-i') 


(ATX[t/t.L]  ~ 


where 


xTli.il  =  ^xi,xi“l . Xi’'  1  ’ 

The  derivative  with  respect  to  »t  vanishes  for 


ATRiA 


(IV. 6) 


where 
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N  T 

Ri  =  t=L+i+l  (t-i , t-L-i ] 


Roi  t=L+i+l  [t-i , t-L-i ] 1  * 


That  minimizes  is  seen  by  taking  the  second  derivative 


2ATR  -A 

Ol 


N 

2  I 
t-L+i+1 


(A  X  f ,  .  ,  .  T1 

[  t- 1  ,  t- 1-  L] 


>  0  . 


(IV. 7) 


This  also  establishes  a  nonzero  denominator  for  0^  in  (IV. 6). 

To  get  an  estimate  of  the  variance  of  ufc  we  use  the  sample 
moment 


(N-L)  a  = 


N 

I 

t=L+l 


u,.  = 


N 

I 

t=L+l 


T  n 

A  X[t,t-L]X 


[  t ,  t-L] ' 


T 

A  R  A 
oo 


(IV. 8) 


where  R  is  qiven  in 
oo  ^ 

The  fact  that  ut 
one  method  to  fit  the 
equations  (3^  =  0  ,  i  = 
that  the  denominators 
simpler  system 

ATRiA  =  0, 


(IV. 6)  . 

is  MA (M)  implies  that  t 
vector  A  to  the  data  is 
M  +  1 ,  ...  , M  +  L  from 

are  nonzero,  we  can  ins 

i  =  M  +  1 ,  ...  , M  +  L. 


^  =  0  for  i  >  M,  so 
to  solve  the  set  of 
(IV. 6) .  Recognizing 
tead  solve  the 

(IV. 9) 


The  locus  of  solutions  for  each  of  the  above  equations  is  a  quadric 
surface  [24,  pp.  287-294],  and  there  is  no  solution  to  the  system 
if  the  surfaces  do  not  jointly  intersect  at  least  one  point.  In 
order  to  obtain  a  value  for  A  whether  or  not  there  is  a  common 
intersection,  we  choose  A  to  minimize  Q  given  by 
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i=M+l 


(ATRiA) 2. 


(IV. 10) 


The  value  of  Q  is  zero  at  any  point  of  intersection  for  all  the 
surfaces  in  (IV. 9).  The  vector  A  obtained  via  minimization  of  Q 
will  henceforth  be  referred  to  as  the  LS  estimate.  Using  the 
LS  estimate,  we  next  find  6^,  i=l,  ...  ,M  using  (IV. 6)  and  ou 
using  (IV. 8)  and  substitute  these  values  in  (IV. 4)  and  (IV. 3)  to 
get  an  estimate  of  the  PSD  of  xt,  henceforth  termed  the  LS 
spectral  es timate . 

IV. 3  The  MYW  Spectral  Estimator 

The  rational  spectral  estimator  in  [  ]  (the  MYW  estimator) 

produces  A  to  solve  the  modified  Yule-Walker  equations,  which 
for  an  ARMA  (L,M)  process  are 


cm+ia  -  0 


(iv. ii ; 


where 


C.  =  c.  . 
k  13 


cx(k+i-j) 


i  1,  ...  ,  L  ?  3  If  ...  ,  L*t*  1 , 

and  where  c  (i)  is  an  estimate  of  the  ith  lag  autocorrelation  of 
x  ,  for  example, 


Cx(i>  N  t£1  XtXt+i ' 


(IV. 12) 


Then  estimates  of  the  autocorrelation  function  r  (i)  of  u^  are 

u  t 

found  by 


ru(i)  -  A‘C.A.  i>0,l . H 


(IV. 13) 
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and  the  estimated  PSD  is 

S  (z)  =  [r  (0 )  +  r„ (1) (z-1+z)  + . . . +  r„ (M) (z~M  +  zM) ]  •  ATZ  ~2. 

X  u  U  U  Li 

(IV. 14) 

Note  that  the  LS  and  MYW  estimates  of  A  are  asymptotically 
equivalent.  To  see  this,  observe  that 


p  lim(N-L-k 

N-^°° 


+ 


A 


T 

k 


(denotes  convergence  in  probability)  [25]  where  incorporates 
the  mth  lag  autocorrelation  (m)  of  according  to 


rx(k+i~j) , 

i  1,  ...  ,  L+l ,  ^  '’l  /  ...  ,  L+l  , 


and  that  p  limN_)_ooCk  =  j  ]  j  |  j  ,  i  =  l,  ...  ,L;  j  =  l,  ...  ,L+1.  The 

k 

solution  to  ( IV .11)  obtained  by  replacing  by  its  limiting  value 

also  solves  (IV. 9)  with  (N-L-k)  replaced  bv  its  limiting  value. 

There  is  one  other  solution  to  this  asymptotic  form  of  (IV. 9), 
but  we  conjecture  that  it  is  outside  the  stationary  region  for  A. 

The  spectral  estimate  given  by  (IV. 14)  is  not  guaranteed  to 
be  a  nonnegative  function  of  z  =  e^71^  for  fe [0,1/2].  For  instance, 
if  observations  consist  of  two  additive  narrow-band  AR(2)  signals 
having  center  frequencies  in  close  proximity,  the  true  z-spectrum 


consists  of  two  closely  spaced  poles  just  inside  the  unit  circle 
and  a  zero  just  inside  the  unit  circle  and  between  the  poles  in 
frequency.  An  error  in  estimating  the  numerator  polynomial  in 
(IV. 14)  can  cause  what  should  be  a  near-zero  positive  value  at 
the  bottom  of  the  trough  in  the  frequency  response  of  the  numerator 
to  be  a  near-zero  negative  value,  thus  making  the  PSD  estimate 
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negative  in  a  region  near  the  peak.  An  effective  counter  is 
to  reduce  the  depth  of  the  trough  by  multiplying  the  r^(i)  by  a 
taper  which  slightly  reduces  the  frequency  resolution  of  the 
estimated  numerator.  Use  of  the  linear  taper 

=  1  -  i/K ,  i=0 , 1 ,  ...  ,M,  K  >  M  (IV. 15) 

has  been  successful  in  eliminating  negative  excursions  of  the 
PSD  estimate  when  used  to  produce  ru(i)t  according  to 

ru(i)  =  ^’iru  ( 1 )  /  1=0  ,  .  .  .  ,  M . 

Another  strategy,  used  in  [20],  is  to  replace  the  numerator  in 
(IV. 14)  by  the  periodogram  of  ufc .  For  short  data  records,  however, 
the  periodogram  often  cannot  adequately  represent  the  movinq 
average  spectrum,  resulting  in  inaccurate  indication  of  the  power 
under  the  peaks  of  the  ARMA  spectrum.  In  tests  of  the  MYW 
estimator,  all  negative  excursions  were  eliminated  for  K  >>  M. 

No  negative  excursions  were  noted  for  the  LS  algorithm,  but  the 
existence  of  a  guarantee  has  not  been  investigated.  It  is 
emphasized  that  the  use  of  the  numerator  taper  does  not  sacrifice 
the  resolution  of  the  ARMA  spectral  estimates.  This  is  due  to 
the  fact  that  resolution  is  mainly  determined  by  the  poles  and 
the  MA  spectra  of  interest  are  relatively  smooth. 

IV. 4  Simulation  Results 

Figure  (IV. 1)  shows  LS  estimates  of  the  PSD  of  20  realizations 
of  xt  =  wt  +  Yt  +  0.5  nt  where  wfc  =  0.4  -  0.93  wt_2  +  e  and 

Yt  =  -0.5  -  0.93  yt_2  +  nt  and  where  ,  nt,  and  nfc  are 

mutually  independent  i.i.d.  Gaussian  of  mean  zero  and  unit  variance. 
Q  in  (IV. 10)  is  minimized  using  the  conjugate  gradient  technique  [26] 


with  A  starting  at  the  origin.  The  combined  effect  of  the  plots 
is  to  suggest  a  bias  in  the  estimator  which  smears  the  spectra. 

In  Fig.  (IV. 2)  the  tapered  MYW  estimate  is  depicted  for  the  same 
set  of  realizations.  K  in  (IV. 15)  is  set  to  the  minimum  value 
that  succeeds  in  eliminating  all  negative  excursions  of  the 
spectral  estimate.  In  this  set,  the  untapered  MYW  estimates  of 
seven  realizations  went  negative,  and  K  =  34 .  Figure  (IV. 3)  shows 
the  unconditional  least  squares  Box  and  Jenkins  estimate  for  the 
same  set  of  realizations.  It  is  apparent  that  the  performance 
of  the  MYW  estimator  is  nearly  as  good  as  that  of  the  Box  and 
Jenkins  estimator  for  most  realizations.  The  disturbing  tendency 
of  the  MYW  estimator  to  produce  an  occasional  aberrant  estimate 
is  also  exhibited  in  the  figure. 

The  simulations  provide  evidence  that  the  computational 
simplification,  compared  to  the  optimum  schemes,  provided  by 
data  reduction  in  the  LS  estimator  is  accomplished  at  the  expense 
of  a  substantial  tradeoff  in  statistical  efficiency.  The  results 
also  suggest  that  the  MYW  estimator  has  higher  statistical  efficiency 
than  the  LS  estimates.  These  observations,  coupled  with  the  high 
computational  efficiency  of  the  former,  lead  us  to  conclude  that 
the  MYW  estimator  is  the  superior  of  the  two  suboptimum  spectral  . 
estimators  considered . 
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Fig.  IV. 1.  LS  spectral  estimate,  N  =  125. 
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V.  STATISTICAL  CLASSIFICATION  OF  SOME  ARMA  SPECTRAL  ESTIMATORS 


Many  of  the  spectral  estimates  appearing  in  the  engineering 
and  statistics  literature  over  the  last  two  decades  fall  into 
one  of  two  classes.  In  the  minimal  sufficient  (MS)  class,  the 
AR  and  MA  parameters  are  adjusted  simultaneously  to  give  a  least 
squares  or  approximately  least  squares  fit  to  the  observed  data 
vector.  The  name  "minimal  sufficient"  acknowledges  that  no 
reduction  of  the  data  is  a  sufficient  statistic  for  stationary 
Gaussian  time  series  having  zeros  in  the  power  spectral  density 
(PSD)  [31].  The  resulting  estimates  are  approximately  maximum 
likelihood  (ML)  for  Gaussian  data.  In  the  sequel,  it  is  assumed 
that  the  data  is  a  Gaussian  time  series,  and  the  ML  ARMA 
parameter  estimate  is  referred  to  as  optimum,  for  reasons 
discussed  in  Chapter  IV.  Thus,  optimum  solutions  are  contained 
in  the  MS  class.  The  equations  for  the  least  squares  solution 
are  highly  nonlinear,  and  thus  estimators  in  this  class  are 
typified  by  nonlinear  optimization  algorithms  and  other  iterative 
approaches  and  their  concomitant  pitfalls,  i.e.,  nonconvergence 
or  convergence  to  local  rather  than  global  extrema,  the  need  for 
preliminary  ARMA  parameter  estimates  to  start  the  iterations, 
and  large  computer  memory  and  time  requirements.  The  auto¬ 
correlation  function  ^iass  is  composed  of  estimators  which 

utilize  a  fixed,  finite  number  of  lags  of  the  sample  ACF .  The 
estimation  equations  are  usually  linear  or  quadratic,  and  often 
the  AR  parameters  are  estimated  alone,  after  which  estimates 
either  of  the  MA  parameters  or  of  some  function  related  to  the 
PSD  of  the  MA  residuals  are  computed  using  the  AR  estimates.  All 


of  the  estimators  in  this  class  are  suboptimum,  the  truncated 
sample  ACF  being  a  data  reduction.  Further,  an  optimum  fitting 
requires  that  all  parameters  be  adjusted  simultaneously. 
Suboptimality,  then,  is  the  price  paid  for  the  gains  in 
computational  simplification  and  relaxed  memory  requirements 
of  estimators  in  this  class. 

In  its  emphasis  on  computational  efficiency,  the  literature 
has  left  largely  unanswered  questions  regarding  the  statistical 
efficiency  of  estimators  in  the  ACF  class.  We  provide  in  this 
chapter  an  evaluation  of  Fisher's  information  matrix  for  the 
truncated  sample  ACF  of  ARMA  processes.  Only  the  asymptotic 
(infinite  observation  record,  i.e.,  N  -*  °°)  case  is  treated  to 
make  the  analysis  tractable,  and  the  magnitude  and  angle  of  poles 
and  zeros  of  the  PSD  are  taken  as  parameters  of  interest.  The 
results  will  yield  asymptotic  bounds  on  the  statistical  efficiency 
of  any  estimator  in  the  ACF  class. 

V.l  Classification  of  Some  ARMA  Spectral  Estimators 

In  keeping  with  the  goal  of  obtaining  asymptotic  results, 
the  estimators  to  be  discussed  are  classified  according  to  the 
limiting  form  that  the  statistic  on  which  they  are  based  takes 
as  N  approaches  infinity.  The  MS  class  includes  the  methods  of 
Tretter  and  steiglitz  [32],  Hannan  [33],  Akaike  [34],  Konvalinka 
and  Matousek  [35],  and  Box  and  Jenkins  [7,  pp.  231-235].  The 
methods  of  Walker  [36],  Hsia  and  Landgrebe  [37],  Graupe , 

Krause  and  Moore  [38],  Sakai  and  Arase  [39],  Satorius  and  Alexander 
[40],  Kavoh  [?1],  Kinkel  et  al.  [22],  and  Bruzzone  and  Kaveh  [19] 
are  induced  in  the  ACF  class.  Cadzow's  method  [20]  is  based  on 
a  reduction  of  nearly  all  N  lags  of  the  sample  ACF  and  hence  does 
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not  fit  into  either  class.  We  have  observed  a  tendency  toward 
large  frequency  errors  in  Cadzow's  method  as  well  as  an 
exaggeration  of  the  sharpness  of  spectral  peaks  while  requiring 
nearly  as  much  computer  time  and  memory  as  methods  in  the  MS 
class.  These  facts,  in  conjunction  with  the  suboptima  1 i ty  of 
the  method  as  a  result  of  the  use  of  a  data  reduction  as  well 
as  non-simultaneous  determination  of  the  AR  and  MA  parameters, 
lead  us  to  dismiss  Cadzow's  method  from  further  analysis.  A 
number  of  estimators,  e.g.,  [36],  which  are  based  on  the  sample 

ACF  yet  claim  to  be  asymptoti cal ly  efficient  from  a  subclass  of 
the  ACF  class.  There  is  no  discrepancy  here,  however,  in  that 
efficiency  obtains  in  the  limit  as  first  the  number  of  observations, 
then  the  number  of  sample  ACF  lags,  approach  infinity.  In 
practice  we  are  not  at  liberty  to  extend  the  estimators  in  such 
fashion,  so  these  estimators  are  analyzed  assuming  a  finite 
truncation  of  the  sample  ACF. 

Looking  more  closely  at  the  ACF  class,  we  see  that  [37], 

[21],  [22],  [40]  and  [19]  first  compute  an  estimate  of  the  AR 

parameters,  and  then  use  these  to  estimate  some  function  related 
to  the  MA  parameters.  Thus,  the  statistical  efficiency  of  these 
estimators  is  determined  largely  by  the  efficiency  of  the  AR 
parameter  estimates  in  the  first  stage.  The  AR  parameters 
determine  the  pole  locations  of  the  ARMA  PSD,  and  consequently 
they  have  far  greater  influence  on  its  shape  than  do  the  MA 
parameters  in  the  problem  of  high  resolution  spectra.  It  is 
clear  that  the  asymptotic  efficiency  with  respect  to  only  the 
estimated  pole  locations  of  various  estimators  is  a  meaningful 
criterion  by  which  to  judge  them.  For  example,  [37],  [21],  [22], 


[40]  and  any  method  based  on  modifying  the  numerator  portion 
of  their  estimated  PSD’s  (as  in  Kay  [41])  are  equivalent  by 
this  criterion,  a  result  that  agrees  with  our  experience. 

The  two  popular  sample  ACF's 

1  ^ 

°xU)  =  N  xtxt+i  ;  1  =  0'1 . k 

and  (V . 1 ) 


rx(i: 


1 

N-i 


N-i 

v 

t=l 


xtxt+i 


i=0 , 1 ,  .  .  .  ,k 


are  asymptotically  equivalent  for  finite  k.  We  take  as  the 
statistic  of  interest  C^k  k^  =  [cx  Oc^)  ,cx  (k^+1)  ,  .  .  .  ,cx  (k2)  ]  , 

k^  <  ^2*  This  allows  us  to  take  into  account  the  effect  of  not 
using  low-lag  ACF  values,  as  is  the  case  in  the  five  estimators 


listed  in  the  previous  paragraph,  wherein  cx(o)  is  not  used  to 
estimate  the  AR  parameters.  The  asymptotic  Fisher’s  information 
matrix  is  derived  for  any  subset  of  the  poles  and  zeros  by 
appropriate  choice  of  the  parameter  vector. 

We  mention  briefly  the  work  of  Gersch  [42],  who  provides 
the  asymptotic  covariance  matrix  of  the  AR  parameter  estimates 
gotten  from  the  modified  Yule-Walker  operations,  and  that  of 
Sakai  and  Tokumaru  [47],  who  give  the  covariance  of  the  estimated 
power  spectrum  gotten  from  using  [21]  or  [22]  (these  are 
equivalent) .  Little  else  has  been  done  to  analyze  the  suboptimum 
ARMA  spectral  estimators,  and  while  these  results  are  useful, 
they  lack  the  general  applicability  of  the  analysis  we  now  undertake. 
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V.2  Evaluation  of  Fisher’s  Information  Matrix 

We  begin  by  evaluating  the  asymptotic  covariance  matrix  of 
the  sample  ACF  for  an  ARMA(L,M)  process,  where  we  assume  L  >  0, 
using  the  notation  and  assumptions  of  Chapter  IV*  It  is  known 
that  {12,  p.  181],  for  a  stationary  time  series  xfc , 

1  °° 

<Um  cov  [c  (k)  ,c  ( l)  ]  =  rr  I  y  (ijy  (i+5L«ki (l  +  JDy  (i-k)  (V.2) 

X  X  N  i=s-oo 


where  y(i)  =  Efx^x^.^).  The  ACF  of  an  ARMA ( L, M)  process  is 
recursive  beginning  with  lag  M«  Starting  values  are  given  by 
the  appropriate  system  below  (see  {44]), 
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where 


Di  *  V  .  "  ,  Y  (a  -bt)B  detla  )  ♦  oc\,  (V.4) 
3=1+1  z  =  l  J  J 

with  Bq  =  1,  for  B^  =  0  for  i  >  M,  =  0  for 


i  >  L,  and  Z  =  0  for  j  <  i.  Also, 
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ak  = 


a,  a. 
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1  1 


;  k  >  1 


(V.  5) 


;  k  =  0 


where  a.  =  0  for  i  >  L.  For  M  >  1  and  L  >  1  we  extend  the  vector 

of  starting  values  to  lag  M+L-l  using  the  AR  recursion 

% 


Y (k)  =  Z  a. y (k-i) . 
i=l  1 


( V  .  6  ) 


Although  this  recursion  can  be  used  to  extend  the  ACF  arbitrarily 
far,  the  infinite  sums  in  (V. 2)  are  simplified  by  expressing  the 
ACF  in  terms  of  its  poles,  i.e., 


Y(i) 


j  =  l 


i-M 

IjGj  '  ,  l  >  M  , 


( V .  7  ) 


where  the  are  zeros  of  the  characteristic  equation 


1  -  Z  a .  z  =  0  , 
i=l  1 


(V.8) 
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and  Aj  is  the  residue  of  G  j  .  Usinq  the  extended  ACF ,  we  find 
the  A  j  by  solving 


1  ...  1 


Y (M+l ) 


(V .  9 ) 


G.L  1  G0L_1  ...  GtL  1  XT  Y  (M+L-l) 

l  1  £  L  L 


The  infinite  sums  in  (V.2)  can  be  expressed  in  terms  of  one-sided 


infinite  sums,  as 
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3  evei 


and  the  infinite  sums  can  be  separated  into  terms  of  the  ACF 
directly  influenced  bv  the  MA  parameters  and  those  followina  the 
recursive  form  (V.7),  as 
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where  the  assumption  that  all  poles  are  inside  the  unit  circle 
allows  the  interchange  of  summation  and  guarantees  convergence 
to  the  closed  form  result. 

Substituting  (V.ll)  into  (V.10),  we  have 
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Finally,  we  note  that  (V.2)  can  be  rewritten 


iim  cov[cx(k) ,c  U) ]  =  ^  I  y (i) y (i+fc-k) +y (i) y (i-£-k) ,  (V.13) 
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where  terms  used  in  the  finite  sums  are  obtained  using  (V.6). 
Note  that  the  above  assumes  £  >  k ,  £  >  0  and  k  >  0.  This 
presents  no  problem  in  evaluating  the  covariance  matrix,  for 
c  (k)  =  c  (-k)  implies  that  negative  values  of  £  and  k  are  not 
needed,  and  the  toeplitz  form  of  the  covariance  matrix  implies 
that  cov  [c  (k)  ,c  U)  ]  =  cov  [c  ( H)  ,c  (k )  ]  .  The  case  L  =  0  is 

A  A  XX 

handled  by  (V.3b)  with  (V.4)  and  (V.5).  (V.10)  is  modified  to 

M 

2  Z  y2(i)  +  Y2(0)  ;  j  -  0 

i  =  l 


00 

Z  Y  (i)  Y  (i+j  )  = 

i=-oo 


f 


(i=0 


j-1 

2 

Y  ( i )  Y  (i  + j  )  +  Z  i)Y(j-i) 

i=l  j 


odd 


I  -i 

M  2 

I  Y  (i  )  Y  ( i  + j  )  +  Z 
i=0  i=l 


Y (i) Y ( j— i ] 


+>2(j)  t 


j 


(V .  15 


even . 


Then,  using  (V.2) 


we  have ,  for  L  =  0 , 
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The  probability  density  function  (p.d.f.)  of  the  sample  ACF 
of  an  ARMA  process  is  asymptotically  multivariate  Gaussian  [  ] 

If  we  denote 


r[k  =  [Y(k1)/Y(k1+l)^../y(k2)]  ;  k2  >  k1 


T 

A[kl'k2]  =  E(C[kl'k21”r[kl'k2]) (C[kl'k2]"r[kl'k2]  '  (V-17) 


then  the  asymptotic  p.d.f.  of  Cr,  .  ,  is  given  by 
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where  G  represents  the  vector  of  poles  and  P  the  vector  of  zeros. 
Fisher's  information  matrix  is  given  by  [45]  (denote  lim  f  =  f  ) 
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We  take  rJQ  =  ^(G^l,...,  i<^Ly2‘'^>l''’’,^L/2,  ^l  ,*,*,  ^M/2'':*;l,***f^M/2], 
where  the  assumption  that  poles  and  zeros  occur  in  complex  conjugate 
pairs  halves  the  size  of  the  parameter  vector.  The  are  angles  of 
the  pole  pairs,  and  zeros  solve 
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We  have  been  unable  to  obtain  the  derivatives  in  (V.19) 
due  to  the  complicated  nature  of  the  p.d.f.,  and,  even  if  they 
were  available,  it  is  doubtful  that  the  expectation  would  yield 
to  a  concise  solution.  Numerical  methods  are  not  well  developed 
for  multiple  integrals,  and  these  have  the  added  complication 
of  being  improper,  so  we  consider  Monte  Carlo  methods.  The 
most  basic  approach  is  to  generate  several  thousand  points 


(Cr.  I,  i)*  distributed  according  to  (V.18)  and  to  approximate 
l K  i  /  K  2  *  1 

the  expectation  by  a  sample  mean 
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where  -  is  the  number  of  points  generated.  The  matrix  of  second 
partials  is  computed  using  standard  numerical  routines,  i.e., 
we  make  small  perturbations  in  9Q.  For  each  new  0  ,  we  recompute 
poles  G^,i=l,...,L  and  zeros  ,  i=l , . . . , M ,  which  occur  in  conjugate 
pairs.  Then,  we  obtain  the  corresponding  vectors  A  and  B  by  solving 

(1-G1z“1) (1-G2z_1) . . . (1-Glz_1)  =  l-a1z_1-...-aLz"L 


(V. 22’ 
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from  which  we  proceed  to  the  calculation  of  y(i)  and  eventually 
A k  ]  m  The  sum  in  (v-21)  is  evaluated  for  the  several  perturba¬ 
tions  of  9  needed  in  numerical  evaluation  of  the  matrix  of  second 


partials,  using  the  same  set  {(Cf,  . 

*  1 '  z  1 


).}  each  time.  If  we  are 


interested  in  the  information  with  respect  only  to  poles,  we 
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use  the  reduced  parameter  vector  8 
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We  then  estimate  the  asymptotic  Cramer-Rao  bound  on  the  error 
covariance  matrix  for  8o  (note  that  we  consider  only  the  class  of 
asymptotically  unbiased  estimators)  as 


a  /v  m  i  /v  l 

(VV(Vso>  ^  I  <0O) 


(V. 23] 


where  the  matrix  ordering  is 


T  T 

R>Q'*yRy>yQy 


for  R  and  Q  square  of  dimension  (say)  K  and  v  any  vector  of 
dimension  K. 

Questions  regarding  the  details  of  the  Monte  Carlo  simulation 

are  under  investigation.  Next,  we  will  consider  the  special 

2 

case  of  AR  time  series  in  white  noise  of  variance  a  .  The 

n 

effect  of  this  noise  on  the  information  in  Cri  .  ,  is  evaluated 

[k1  ,k2l 

by  replacing  y(0)  obtained  from  (V.3a)  by  y*(0)  =  y(0)  +  . 

This  will  give  a  quantitative  measure  of  the  effect  of  the  model- 
change  phenomenon  in  the  presence  of  noise.  Studies  of  more  general 
time  series  and  noise  will  follow. 


VI.  SPECTRAL  ESTIMATION  FOR  NOISY  COMPLEX  SINUSOIDS 


High  resolution  spectral  estimation  has  found  special 
popularity  in  applications  involving  real  or  complex  sinusoidal 
signals.  Examples  of  these  applications  include  radar  doppler 
processing,  and  radar  or  other  sensor  array  processing  for 
improved  angular  resolution  [27].  A  method  that  has  found 
special  appeal  in  these  applications  is  the  maximum  entropy 
method  of  spectral  analysis  (MEM)  introduced  by  Burg  [  8  ] . 

However,  there  have  been  several  disturbing  problems  with  this 
method  when  applied  to  sinusoids,  notably,  frequency  errors 
depending  on  the  sinusoidal  components'  phases  and  line  splitting 
under  high  order  estimates  and  large  signal- to-noise  ratios. 

We  have  considered  the  frequency  error  problem  and  have 
investigated  a  modification  to  Burg's  original  alaorithm, 
which  we  denote  the  tapered  Burg  algorithm.  The  tapered  Burg 
technique  is  a  direct  result  of  considering  a  weighted  least- 
squares  fit  to  the  parameters  of  the  all-pole  model,  subject 
to  Levinson's  recursion  constraint,  in  place  of  the  usual 
unweighted  least-squares  fit.  The  algorithmic  consequence 
of  this  approach  is  the  inclusion  of  an  appropriate  taper 
in  the  calculation  of  the  partial-correla tion  coefficients  in 
the  usual  Burg  algorithm.  Based  on  the  expression  for  the 
frequency  error  in  the  spectral  estimate  of  a  sinusoid,  an 
optimum  taper  is  derived.  The  performance  of  this  optimum  taper 
is  then  compared  with  those  of  the  rectangular  (untapered  Burg) 
and  Hamming  tapers.  It  appears  that  this  taper  makes  MEM  spectral 
estimates  of  sinusoids  using  Burg's  technique  more  robust,  without 
sacrificing  its  resolution. 
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In  the  popular  derivation  the  MEM  coefficients  define  a 
predication  error  filter  (PEF)  and  are  chosen  so  as  to  minimize 
the  average  of  the  forward  and  backward  prediction  residual 
energies  subject  to  Levinson’s  recursion  constraint.  In  [  8  ] 

Burg  generalized  this  derivation  by  minimizing  a  weighted 
average  of  the  residual  energies.  This  was  presumably  done  to 
allow  the  analyst  to  reflect  his  or  her  confidence  in  possibly 
disjoint  data  records.  For  contiguous  data,  it  did  not  appear 
that  any  weighting  of  the  average  residuals  was  needed. 

Therefore,  the  popular  MEM  spectral  estimate  has  been  that  of 
Burg's  original  suggestion  of  using  a  rectangular  (uniform) 
taper.  Recently,  Swingler  showed  [28],  through  numerical 
simulations,  that  a  reduction  in  the  error  in  the  estimated 
frequency  of  a  real  sinusoid  is  obtained  if  a  Hamming  taper  is 
employed  in  the  calculation  of  Burg's  partial-correlation 
(PARCOR)  estimates.  This  method  of  tapering  is  exactly  that 
reported  in  [  8  ]  by  Burg. 

In  this  chapter,  we  first  derive  an  expression  for  the 
estimated  frequency  error  of  a  real  sinusoid  using  Burg's  tapered 
method.  This  is  a  generalization  of  the  error  expression  for  the 
untapered  Burg's  technique  reported  by  Swingler  [29].  We 
subsequently  -use  the  error  expression  to  derive  an  "optimum" 
taper.  Finally,  simulation  results  comparing  the  optimum, 

Hamming  and  rectangular  tapers  in  obtaining  Burg  spectral  estimates 
of  complex  sinusoids  in  noise  are  given. 

VI. 1  Generalized  Error  Expression 

In  this  section  we  derive  an  expression  for  frequency  error 
of  sinusoid  based  on  the  tapered  Burg  technique  for  a  General 
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weighting  function.  Let  the  taper  wm(t)  be  defined  as  a 
function  of  the  continuous  variable  t  for  |t|  <  and  take 

on  the  value  zero  elsewhere.  Let  the  discrete  time  version  be 


N-m. 

wmk  =  Vk  -  ir*  • 


(VI.l) 


As  in  (6)  we  assume  that  the  window  is  normalized  and  non-negative: 


N-m 

E  w  .  =1  ,  all  m  , 

k=0  ^ 


Wmk  -  0 


,  all  m,k  . 


The  non-negativity  insures  that  the  magnitude  of  the  partial 
correlation  coefficients  (PARCORs)  do  not  exceed  unity. 

For  the  data  record  ,k=0 , 1 , . . . , N ,  the  nth  PARCOR  is  given 
in  this  tapered  Burg  method  as  [  8  ] 


mm 


-2 


N-m 

k=0  WnikDmkEmk 

N-m  I  I 

Z  w  .  (D  ,  +  E  ,  ) 

^=0  ^  mk  mk 


(VI. 2) 


where  and  Emlr  are  given  in  (VI.  6). 


"mk 


For  =  cos  (0k  +  4)),  the  first  PARCOR  is 


lll 


cos  (  0  )  + 


2  N-l 

sin  (9)  Z  w,. cos  (20k  +  0+2<J>) 
k=0  iK 


N-l 

l+cos(0)  E  w1 ,  cos  (  20k  +  6+2<J>) 
k=0  iK 


(VI. 3) 


We  choose  to  express  the  first  PARCOR  as 


a^1  =  -cos  (9  +  5) 

=*  - [cos ( 0) +6sin  {  0) ) 


for  5  <<  1  . 


(VI. 4) 
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From  (VI. 3)  we  identify  5  as 


N-l 

E  w.-cos  (20k  +  0  +  2<f>) 

k=Q  iK _ 

N-l 

l+cos(6)  E  w-, .  cos  ( 20k  +  0+2(J> ) 
k=0 


(VI. 5) 


To  look  at  the  conditions  which  make  6  small  we  further  assume 
that  the  window  w^(t)  is  even  and  thus  its  Fourier  transform 
W^(oi)  is  real.  The  summation  in  (VI.  5)  then  becomes 


N-l 

E  w. ,  cos (20k  +  0+20)  = 
k=0 

(VI. 6) 

CO 

cos  (N0+2$ )  E  (-l)n(N-1)W1(2^n  +  2ij) 

n=-co 

Since  W,  (o)  is  bandlimited  on  the  order  of  r/s  we  can  approximate 

the  summation  when  |  9  (  <  tt  and  N  >>  1  as 


N-l 

Z  w,,  cos  (20k  +  0  +  2<M 
k=0  1K 


cos (N0+2$) Wx (20) 


<  TT 


(VI .7) 


From  (VI. 6)  and  the  constraints  on  w^  we  have  that  W^(0 
Thus  we  see  that  the  assumption  5  <<  1  is  valid  when  |0j 
and  N  >>  1. 

The  second  PARCOR  is  exactly  given  by 


< 


1. 


a2  2 


A 

B 


N-2 

Z 

k=0 

N-2 

Z 

k=0 


W2j,ccos(2  0k*f20-f24>) 


W2^cos  (20k+20  +  24>) 

j 


(VI. 8) 


where 
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2 

A  -  +  2a^cos(0)  +  cos  (20) 

2 

B  =  a^  +  2a^cos(9)  +  1 


Using  the  approximation  for  a^  results  in 

A  =  -  ( 1- 6 2 ) sin  (0)  ,  B  =  ( 1-52) sin (0)  , 

and  thus 

a22  ”  1  ^  <<:  ^  • 

The  Levinson  recursion  gives  a^^  =  a^1  +  a22all  -  ^all  anc^  the 
resulting  PEF  is  then  (1,  -2  cos  (0-6)  ,  1),  Comparing  this  to 
the  ideal  PEF  (1,  -2  cos  (0)  ,  1),  we  identify  the  frequency 
error  as 


N-i 

Z  w,,  cos  (20k  +  0+24>) 

1  k=0 

=  2^  sin  ( 0 )  — - - -  (VI. 9) 

1+cos  (0)  Z  w, ,  cos  (2Ok  +  0  +  2(J>) 
k  =  0  iK 


which  may  be  approximated  by 


1 


N-  1 


f  -  sin(9)  Z  w, ,  cos  (  20k  +  0  +  24> )  , 
k=0  ik 

TT  Ini  7T 

FFI  l0l  <  71  _  fFl 


-  sin (0) cos (N6+2$ ) WI (20) 


(VI 10  (a)  ) 


(VI. 10 (b) ) 


For  uniform  weighting,  ^  ,  we  have 


N-l 

i-  wlkcos(20k  +  9  +  2^)  =  cos  (N0  +  24>) 


and  (10a)  reduces  to  Swingler's  expression  [29]. 
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VI .2  An  Optimum  Taper 

If  we  consider  the  phase  4>  as  a  random  variable  uniformly 
distributed  on  [-tt,7t)  then  the  mean  value  of  the  frequency  error 
is  zero,  using  (10a) .  The  variance  of  this  frequency  error  is  then 


1  ~  N-l  N-l 

var(Af)  =  — sin^(0)  I  Z  w, -w,  Ocos  ( 29  (k-  l)  ) 
8  tt  2  k=0  i=0  11 


(VI. 11) 


As  a  criterion  for  selecting  a  taper  we  use  the  average  frequency 


error  variance: 


<var(Af)>  =  —  var(Af)d6 


N-l  N-l 


8  tt  k  =  0 


wlkwl i  [2  Sk-i  4  6l-k+£  4  5l+k-$J  ' 


(VI .12) 


where  the  subscripted  delta  is  the  digital  (Kronecker)  impulse. 

With  the  normalization  constraint  introduced  using  the  Laarange 
multiplier  X  the  resulting  optimum  taper  is  given  by  the  solution  of 


c  WL  =  X  ,  cu  =  2^cij 


-1  i  j  =  i±l 
0  ,  otherwise 


'  VA 


(VI. 13) 


The  general  system  of  equations  with  a  tri-diagonal  coefficient 
matrix  has  a  known  recursive  solution  related  to  the  LU  decomposition 
of  the  coefficient  matrix  (see  for  example  [30]).  For  the  system 
of  equations  with  the  special  coefficient  matrix  in  (VI. 13),  we 
have  derived  closed  form  expressions  for  the  taper  coefficients 
(Appendix  B).  These  are  given  for  the  m-th  order  PARCOR  by: 


6  (k  +  1)  (N-m-k  +  1 ) _ 

;mk  (N-m+1)  (N-m+2)  (N-m+3"! 


,  k=0 ,  .  .  .  , N-m 


(VI . 14) 
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Thus,  this  optimum  taper  is  parabolic  in  form,  it  is  even, 
positive  and  has  a  maximum  at  k  =  ^  ™ —  .  Furthermore,  knowinq 
wmQ  and  from  (VI. 14),  one  can  generate  the  remaining 

coefficients  recursively  from: 


w  i  —  2w  n  x  -  w  ,,  o  \  ~ 
mk  m(k-l)  m(k-2) 


(VI . 15) 


VI .  3  Simulation  Results 

The  effect  of  tapering  was  numerically  investigated  by 
generating  spatial  samples  of  complex  sinusoids  in  noise  for 
various,  phase  and  angle  (frequency)  combinations.  For  brevity 
of  space,  however,  only  the  results  of  a  very  few  representative 
simulations  are  included  in  this  report.  The  method  designations 
on  the  plots  are:  Burg  for  untapered  Burg  method,  WBurgH  and 
WBurgO  for  tapered  Burg  technique  using  the  Hamming  taper  and 
the  optimum  taper  respectively.  The  Hamminq  taper  is  used  for 
comparison  with  the  simulations  in  [28], 

Figure  (VI. 1)  shows  three  tapered  Burg  spectral  estimates 

of  two  complex  sinusoids  at  -30°  and  -45°  off  broadside  with  zero 

phases  each,  in  complex  white  noise.  The  signal- to-noise  ratio 

is  15  dB  where  SNR(dB)  —  10  1  og  ( )  .  It  is  obvious  in  this 

on 

example  that  WBurgO  has  the  smallest  frequency  errors  as  well  as 
the  highest  resolution.  This  has  been  the  case  in  the  great 
majority  of  the  examples  that  we  have  run  thus  far.  Except 
for  a  few  cases  the  bias  of  WBurg  optimum  has  been  lower  than  Burq. 

The  resolution  of  WBurg  Hamming,  however,  was  found  to  be  consistently 
poorer  than  Burg  and  WBurgO.  Figure  (VI. 2)  shows  the  spectral 
estimate  of  a  real  sinusoid  with  a  large  SNR.  It  can  be  seen  that 
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WBurg  optimum  taper  has  reduced  the  frequency  error  and  prevented 
peak  splitting  which  is  shown  by  the  untapered  Burg  method  of 
order  4.  This  again  was  accomplished  without  sacrificing 
resolution . 

Figure  (VI. 3)  is  a  plot  of  the  maximum  frequency  error  for 

each  method  for  a  unit  amplitude  real  sinusoid  at  angle  (frequency) 

of  6  =  r/5  as  a  function  of  the  number  of  data  samples.  The 

maximum  frequency  errors  were  found  by  solving  for  the  pole 

locations  over  the  range  of  values  for  the  sinusoid  phase  <J> . 

2 

The  noise  variance  is  a  =  0.025.  It  should  be  noted  that  the 

n 

optimum  taper  was  derived  on  the  basis  of  minimum  average  f requency 
variance .  This  simulation  shows  comparative  error  reduction  for 
the  maximum  frequency  error  only  for  0  =  tt/5  between  the  optimum 
and  Hamming  tapers.  As  was  pointed  out  earlier,  however,  all 
of  our  simulations  have  shown  better  resolution  for  the  optimum 
taper  compared  to  the  Hamming  one.  It  is  also  interesting  to  note 
that  the  main  reduction  in  frequency  error  is  apparently  obtained 
by  tapering  the  first  order  residual  energy  only.  This  has  also 
been  borne  out  by  the  majority  of  our  simulations. 
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DATA  LENGTH 

Maximum  errors  in  the  estimated  frequency  of  a  real 
sinusoid  as  a  function  of  data  length. 


VII.  SUMMARY  AND  CONCLUSIONS 


Data-adaptive  spectral  estimation  methods  have  found 
widespread  application  in  recent  years,  notably  in  radar  signal 
processing.  The  work  summarized  in  this  report  was  concerned 
with  two  such  methods:  the  autoregressive  (AR)  and  the  auto- 
regressive-moving  average  (ARMA)  spectral  estimators.  Our 
main  concern  in  this  investigation  was  the  resolving  capabilities 
of  the  different  models  in  the  presence  of  noise  and  the 
statistical  properties  of  some  new  spectral  estimators. 

Since  the  AR  methods  are  by  far  the  most  popular,  we 
postulated  an  AR  noiseless  signal  model.  The  resulting  appropriate 
model  in  the  presence  of  noise  was  then  shown  to  be  ARMA.  To 
investigate  the  degradation  in  resolution  in  the  spectrum  of 
noisy  signals,  we  also  used  a  parallel  resonator  model  for  the 
signal.  This  model  was  also  shown  to  be  equivalent  to  an  ARMA 
one.  The  resonator  model,  however,  made  it  possible  to  evaluate 
the  loss  in  resolution  as  a  function  of  signal- to-noise  ratio  and 
signal  parameters . 

Since  an  ARMA  model  may  be  approximated  by  a  high-order 
AR  one,  attention  was  next  focussed  on  the  development  of  a 
robust  method  for  identifying  the  order  of  an  AR  model.  A  method 
closely  related  to  that  of  Akaike's  information  criterion  was 
developed.  This  method  proved  to  be  more  stable  than  the 
minimum  AIC. 

ARMA  spectral  estimation  was  treated  extensively  in  this  work. 
The  generality  of  the  model  was  important  from  two  distinct  points 
of  view:  i)  as  the  appropriate  model  for  noisy  AR  signals,  thus 
to  improve  resolution;  ii)  as  a  resonable  model  for  signals  with 


spectra,  narrow  or  wide,  possessing  deep  nulls  and/or  sharp 
roll-offs.  Because  of  the  computational  complexities  of  the 
optimum  (maximum  likelihood)  method  of  estimating  the  ARMA 
parameters,  several  sub-optimum,  computationally  efficient 
techniques  were  devised.  These  methods  behaved  reasonably  veil 
for  moderate  to  large  data  samples,  but  were  inferior  to  the 
optimum  one,  as  expected,  in  terms  of  statistical  efficiency. 
The  efficiency  of  various  classes  of  ARMA  spectral  estimators 
are  currently  under  further  inves tigation  by  Monte  Carlo 
simulations  and  will  be  reported  on  in  the  future. 

The  maximum  entropy  method  (MEM)  of  spectral  estimation, 
using  Burg's  technique,  has  been  very  popular  in  estimating 
the  spectra  of  sinusoidal  type  signals.  It  has  been  known, 
however,  that  the  accuracy  of  such  spectral  estimates  is 
significantly  influenced  by  the  phase  of  the  sinusoids.  Based 
on  some  recent  results,  we  developed  an  optimum  taper  for  the 
residual  eneraies  in  the  Bure  MEM  technique.  It  was  shown  that 
the  use  of  such  a  taper  substantially  reduced  the  sensitivity 
of  the  spectral  peaks  to  the  sinusoids'  phases  and  markedly 
reduced  the  occurrence  of  line-splitting  in  the  usual  MEM 
estimates.  It  is  remarkable  that  these  improvements  were  made 
without  sacrifice  in  the  resolution  of  the  spectral  estimates. 
Work  is  continuing  to  more  fully  explain  the  effect  of  such  a 
taper  and  its  ramifications  in  Burg-type  parameter  estimation 
techniques . 
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APPENDIX  A 


Equation  (11.25)  is 


r  (0)r  (2)  -  (1) 

2  m _ 2_m _ 1  m 

r2  (0)  -  r2  (1) 
y  V 

2  m  1  m 


(A.l) 


Now  substitute  for  r  (0),  r  (2)  and  r  (1)  in  A.l, 


(amo°uacos  V°w'  >  (amoaua4cos  (2wm'^m)  (aL°uaprncos  1  2 

2  2 
2  2  2  2  2  ^ 

( a,  acos4>  +a  ,  )  -(a  a  a,o  cos  (w  -d>  )) 
mo  u  w  mo  u  m  m  ; 


(A. 2) 


Expanding  the  numerator  and  denominator  of  equation  A. 2,  we  aet, 


22  22  2222 

a  P  cos$cos(2w  -6  )  +a  o  a  ,a r,  cos(2w  -4>  ) 
mo  u  m  m  rnr  mo  u  w  m  m  m 


2  2  2  7  2 

-  (a  o,  )  a  o  cos  (w  -g  ) 
mo  u  m  m  m 

2 

22  2  2  4  222 

(a  a  J  a  cos  +c  ,  t  +  2a  a  a  ,acos<J> 
mo  u  m  w’  mo  u  w '  vm 


2  2  2  7  2 

-  (a  oj  ct  o  cos  (w  -<j>  ) 
mo  n  m  m  m 


(A. 3) 


Define  the  s ignal- to-noise  ratio  £  as  , 
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ry  (0>  "  °w’ 
2  m 


r  (0) 
2  m 


where , 


=  Y  -  1 


2  2 

r  (0)  =  a  o  acos4> 
y  mo  u  m 

2  m 


Dividing  the  numerator  and  denominator  of  A. 3  by  ,  and  using 


(A. 4) 


A.  4 ,  we  ge t , 


2  COS(2wnfV 


cos  (2w  -<j>  ) 
m  Tm' 

+  Y  - r -  -  Y 

1  cosd  1 

Ym 


2  cos  ^ 2 

cos2J> 

rr.  ’ 


2  2  . 

_  P^cos  w  -<i  ) 

,  2  .  ,  .  „  2  m  m  m 

Y  +  1  +  2y  -  y  - = - 

cosset 

m 


(A. 5) 


Rearranging  terms  in  A. 5,  we  have 


Y2  (coa^cos  (2wra-$ra)  -cos2  (wm-4,m) )  +Ycos4)mcos  (2*^) 

2  2  2  1  2  2 
Y  cos  <f>  -p  cos  (w  -o  )  -f  2  ycos  <p  +cos  0 
m  m  nr  '  rm  m 


(A. 6) 


As  y 


*  i  and  the  left-hand  side  of  equation  A.  6  is, 


I'W  I 
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APPENDIX  B 


In  this  appendix  we  derive  a  closed  form  expression  for 
the  solution  of  the  following  equations: 


'2  - 1  0  ...  0  ; 

W1 

Y 

-1  2-1  0  ...  0 

• 

0  -1  2-1  ...  0 

= 

. 

0  ...  0  -1  2 

W..  , 

A 

J 

NJ 

(B.l) 


where  A  is  the  normalization  factor  for  W. 

A  recursive  algorithm  for  a  set  of  linear  equations  with 
a  general  tridiagonal  coefficient  matrix  is  given  in  [30].  We 
use  the  notation  used  in  this  reference  to  derive  our  closed 
form  solution. 

The  solution  for  w^  =  w^  is  given  by 


gN 

wM  =  w,  =  — 
N  1  aN 

where  according  to  [30] 


(B.  2) 


(k-1)  ,  , 

^  k  ^k-1  '  ^ 


°t k  “  2  +  Ck  ,  k=2 , 


.  /  N 


=  2 


6k  "  a 


k-1 


/  N 


(B .  3) 


We  first  derive  an  expression  for  6^.  Let  8^  =  -Vj where 
and  6^  are  integers. 

From  (B . 3) 

^k  2  +  «  '  k=2,  ’•*  ,N  (B.4) 

k-  1 
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Then 


^k  vk-l 


uk-l 

2<5k-l  ~  vk- 1 


(B .  5 ) 


Now  let  Sk_^  =  vk-l  +  mk-l  then 


Jk-1 


^k  ^k-1  +  mk-l 


(B.6) 


But  we  have  -  1/  ^2  ”  2  and  m2  =  ^  anc*  ^rom  it  is 

obvious  that  mk  -  constant.  Therefore, 


\  =  6k-l 
6k  5k-l+l 


or  with  =  1  ,  S  2  ~  2 


ek  - 


(k-l) 


(B.7) 


ak  now  simply  follows  as 


k  +  l 

ctk  =  2+0k  =  ,  k=l ,  ...  ,  N 


(B  .8) 


Let  hk  =  kgk  and  substitute  for  qk  in  (B.3).  hk  then  satisfies 
the  difference  equation 


hk  ‘  hk-l  =  kX  '  ho  =  0 


(B .  9 ) 


Z-transform  (B.9)  to  get 


H(z)  = 


z  ,\ 


z  1  (z-1)2 


(B. 10) 


resulting  in 
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k 

h,  =  (step) * (ramp) X  =  A  I  j 

k  j=i 

_  k(k+l)  , 

- 2 -  A 


Therefore 


k+1 

2 


(B.ll) 


( B  .  1 2  ) 


Substitute  in  (B.2)  to  g^t 


w 


N 


w. 


NX 

TT 


( B  .  1  3  ) 


Using  (B.13)  as  the  initial  condition  we  can  now  solve  for 
other  w. 's,  noting  that 


wk  -  2  4-  wk^2 


k=2  , 


t  N 


W  a  0 
O 


W,  = 


NX 

2 


(B.14) 


2-transform  of  (B.14)  results  in 


W(z) 


-z3X  (N+2 ) z  X  Xz2 
(z-1)3  2  ( z- 1 )  2  (z-1)2 


(B.15) 


Taking  the  inverse  transform  of  (B.15)  gives 


wR  =  _  Uk±llSk  +  2l  +  (N±2)kA  +  (k+1)X 


or 


„  _  k (N+l-k ) 

wk - 2 - 


The  normalization  factor  X  can  now  be  found  as 


(B. 16) 
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N 

(  I 
k=l 


12 

n  (N+n  tn+tt 


(B • 17 ) 


Alternatively, 


XN+1  AN 


N 

N+3 


(B . 18 ) 
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